PRSeg: A Lightweight Patch Rotate MLP Decoder for Semantic Segmentation
نویسندگان
چکیده
The lightweight MLP-based decoder has become increasingly promising for semantic segmentation. However, the channel-wise MLP cannot expand receptive fields, lacking context modeling capacity, which is critical to In this paper, we propose a parametric-free patch rotate operation reorganize pixels spatially. It first divides feature map into multiple groups and then rotates patches within each group. Based on proposed operation, design novel segmentation network, named PRSeg, includes an off-the-shelf backbone Patch Rotate containing Dynamic Blocks (DPR-Blocks). DPR-Block, fully connected layer performed following Module (PRM) exchange spatial information between pixels. Specifically, in PRM, split reserved part rotated along channel dimension according predicted probability of Channel Selection (DCSM), our only part. Extensive experiments ADE20K, Cityscapes COCO-Stuff 10K datasets prove effectiveness approach. We expect that PRSeg can promote development
منابع مشابه
Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation
Spatial pyramid pooling module or encode-decoder structure are used in deep neural networks for semantic segmentation task. The former networks are able to encode multi-scale contextual information by probing the incoming features with filters or pooling operations at multiple rates and multiple effective fields-of-view, while the latter networks can capture sharper object boundaries by gradual...
متن کاملSegNet: A Deep Convolutional Encoder-Decoder Architecture for Scene Segmentation
We present a novel and practical deep fully convolutional neural network architecture for semantic pixel-wise segmentation termed SegNet. This core trainable segmentation engine consists of an encoder network, a corresponding decoder network followed by a pixel-wise classification layer. The architecture of the encoder network is topologically identical to the 13 convolutional layers in the VGG...
متن کاملGeodesic Patch-Based Segmentation
Label propagation has been shown to be effective in many automatic segmentation applications. However, its reliance on accurate image alignment means that segmentation results can be affected by any registration errors which occur. Patch-based methods relax this dependence by avoiding explicit one-to-one correspondence assumptions between images but are still limited by the search window size. ...
متن کاملRDFa: Lightweight Semantic Enrichment for Hypertext Content
RDFa is a syntactic format that allows RDF triples to be integrated into hypertext content of HTML/XHTML documents. Although a growing number of methods or tools have been designed attempting at generating or digesting RDFa, comparatively little work has been carried out on finding a generic solution for publishing existing RDF data sets with the RDFa serialisation format. This paper proposes a...
متن کاملLightweight Semantic Prototyper for Conceptual Modeling
While much research work was devoted to conceptual model quality validation techniques, most of the existing tools in this domain focus on syntactic quality. Tool support for checking semantic quality (correspondence between the conceptual model and requirements of a domain to be engineered) is largely lacking. This work introduces a lightweight model-driven semantic prototyper to test/validate...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Transactions on Circuits and Systems for Video Technology
سال: 2023
ISSN: ['1051-8215', '1558-2205']
DOI: https://doi.org/10.1109/tcsvt.2023.3271523